Continuous speech recognition using segmental unit input HMMs with a mixture of probability density functions and context dependency
نویسندگان
چکیده
It is well-known that HMMs only of the basic structure cannot capture the correlations among successive frames adequately. In our previous work, to solve this problem, segmental unit HMMs were introduced and their e ectiveness was shown. And the integration of cepstrum and cepstrum into the segmental unit HMMs was also found to improve the recognition performance in the work. In this paper, we investigated further re nements of the models by using a mixture of PDFs and/or context dependency, where, for a given syllable, only a preceding vowel was treated as the context information. Recognition experiments showed that the accuracy rate was improved by 23 %, which clearly indicates the e ectiveness of the renements examined in this paper. The proposed syllablebased HMM outperformed a triphone model.
منابع مشابه
Self-organization in mixture densities of HMM based speech recognition
In this paper experiments are presented to apply Self-Organizing Map (SOM) and Learning Vector Quantization (LVQ) for training mixture density hidden Markov models (HMMs) in automatic speech recognition. The decoding of spoken words into text is made using speaker dependent, but vocabulary and context independent phoneme HMMs. Each HMM has a set of states and the output density of each state is...
متن کاملImproved parameter tying for efficient acoustic model evaluation in large vocabulary continuous speech recognition
In an HMM based large vocabulary continuous speech recognition system, the evaluation of context dependent acoustic models is very time consuming. In Semi-Continuous HMMs, a state is modelled as a mixture of elementary generally gaussian probability density functions. Observation probability calculations of these states can be made faster by reducing the size of the mixture of gaussians used to...
متن کاملDiscrete-Mixture HMMs-based Approach for Noisy Speech Recognition
It is well known that the application of hidden Markov models (HMMs) has led to a dramatic increase of the performance of automatic speech recognition in the 1980s and from that time onwards. In particular, large vocabulary continuous speech recognition (LVCSR) could be realized by using a recognition unit such as phones. A variety of speech characteristics can be modelled by using HMMs effecti...
متن کاملSemi-continuous segmental probability model for speech signals
A semi-continuous segmental probability model, which can be considered as a special form of continuous mixture segmental probability model with continuous output probability density functions sharing in a mixture Gaussian density codebook, is proposed in this paper. The amount of training data required, as well as the computational complexity of the semi-continuous segmental probability model(S...
متن کاملUsing the self-organizing map to speed up the probability density estimation for speech recognition with mixture density HMMs
This paper presents methods to improve the probability density estimation in hidden Markov models for phoneme recognition by exploiting the Self-Organizing Map (SOM) algorithm. The advantage of using the SOM is based on the created approximative topology between the mixture densities by training the Gaussian mean vectors used as the kernel centers by the SOM algorithm. The topology makes the ne...
متن کامل